Statistical thinking: From Tukey to Vardi and beyond

نویسنده

  • Larry Shepp
چکیده

Data miners (minors?) and neural networkers tend to eschew modelling, misled perhaps by misinterpretation of strongly expressed views of John Tukey. I discuss Vardi’s views of these issues as well as other aspects of Vardi’s work in emision tomography and in sampling bias. Statistics is not in my main skill set but I take this opportunity to record a few thoughts about it I have accumulated over the years. In the ’60’s, John Tukey and his followers brought exploratory data analysis into statistics, partly as a revolt against what was then perceived as an overly rigid and brittle mathematical modelling philosophy that held sway at that time. Some problems seemed to demand such a purely data-driven approach where data mining methods in the absence of mathematical modelling is the driving philosophical methodology. One did not want to be biased by preconceived ideas about the origin of the data by formulating a model but instead allowed the data to “speak for itself”. Vardi liked mathematical modelling and was very good at it. He also promoted data mining, depending on the problem and thus straddled both philosophies. He and I often debated these issues, and were often in friendly disagreement. I will try to argue with concrete examples of work of Vardi and others in statistics that the pendulum should again swing back a bit towards encouraging more mathematical modelling to obtain maximal benefit from the use of statistical procedures by allowing physics, biology, and other fields of science to enter the statistical problem formulation via mathematical modelling of the specific statistical problem at hand. I would argue that the solution to a specific problem ought to somehow depend on the problem itself, which is not the case with neural-nets and other data-driven approaches that live mostly or entirely within the data or training set of the problem. Data-driven statistics has the danger of isolating statistics from the rest of the scientific and mathematical communities by not allowing valuable cross-pollination of ideas from other fields. To illustrate these ideas I will discuss among other concrete examples of statistics problems: emission tomography, machine learning, sampling bias. These topics were debated frequently by Vardi and me. I will do my best to give Vardi’s side as honestly as possible. Needless to say, I wish he were here to continue the debate. I will quote Tukey and/or Vardi on issues I will raise, and you should be aware that people who quote absentees don’t allow the quotees to modify the positions they are being quoted on unless they are in agreement with the ∗Research partially supported by National Science Foundation Grant DMS-0504387. Department of Statistics and Biostatistics, Rutgers University, Hill Center, Busch Campus, Piscataway NJ 08854, USA, e-mail: [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Demography and the Coronavirus Pandemic

Abstract. One of the most urgent policy issues related to the COVID-19 pandemic in Europe concerns the extent and ways in which demographics have determined different patterns of mortality between groups and regions, and whether and how the pandemic and its economic consequences will affect population dynamics in the future. Post-pandemic policy evaluations on the spread of COVID-19 and the im...

متن کامل

The Relation between Deterministic Thinking and Mental Health among Substance Abusers Involved in a Rehabilitation Program

Objective: The current research is to investigate the relation between deterministic thinking and mental health among drug abusers, in which the role of  cognitive distortions is considered and clarified by focusing on deterministic thinking. Methods: The present study is descriptive and correlative. All individuals with experience of drug abuse who had been referred to the Shafagh Reha...

متن کامل

Do Critical Thinking Skills Lead to Success in Language Teaching? A Case of Iranian EFL Teachers Based on Their Gender and Degree of Education

The present study attempted to discover whether there is a significant relationship between EFL teachers' critical thinking ability and their teaching success.To this end, 113 Iranian male and female English teachers were required to fill out Watson Glaser Critical Thinking Appraisal. Besides, their students were asked to answer the Characteristics of Successful EFL Teachers questionnaire. The ...

متن کامل

Cyber Medical Education: Beyond the Integration of Concepts in Technology-based Learning

Introduction: Along with the transition from the digital era to the era of cyber-technology, medical professionals have been forced to use different conceptual systems to meet their informational and communicational needs. These emerging scientific concepts each have specific meaning which should be redefined in their own context so that they could be utilized in the conceptual systems of speci...

متن کامل

Thinking Out of the Box: A Green and Social Climate Fund; Comment on “Politics, Power, Poverty and Global Health: Systems and Frames”

Solomon Benatar’s paper “Politics, Power, Poverty and Global Health: Systems and Frames” examines the inequitable state of global health challenging readers to extend the discourse on global health beyond conventional boundaries by addressing the interconnectedness of planetary life. Our response explores existing models of international cooperation, assessing how modifying them may achieve the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008